Norwegian Speech Recognition for Telephone Applications
نویسندگان
چکیده
In this paper we present a Norwegian tele phone speech database TABU We discuss the database design speci cation and some ex periences with recording and labelling of the database We also present some preliminary re sults with a word based recogniser trained on a subset of the database
منابع مشابه
Norwegian numerals: a challenge to automatic speech recognition
This paper addresses the problem of speaker-independent connected numeral recognition over telephone lines. Increasing the vocabulary from digits (0-9) to numerals (0-99) opens for more user-friendly services, but it also introduces many new, language-specific problems. This paper investigates morphological, phonemic and allophonic variations in the pronunciation of numerals in Norwegian. If im...
متن کاملImproving Performance of Telephone- Based Mandarin Speech Recognition
Since telephone is the only ubiquitous communications device in current world, it is the largest potential application field for speech techniques. Telephony speech recognition is a core technique for such telephone-based speech applications. It is well known that the bandwidth of telephone line is limited to 300~3400Hz and there are many inherent variations within the telephone network. All th...
متن کاملA Comparative Study of Gender and Age Classification in Speech Signals
Accurate gender classification is useful in speech and speaker recognition as well as speech emotion classification, because a better performance has been reported when separate acoustic models are employed for males and females. Gender classification is also apparent in face recognition, video summarization, human-robot interaction, etc. Although gender classification is rather mature in a...
متن کاملAutomatic Classification and Transcription of Telephone Speech in Radio Broadcast Data
Automatic transcription of telephone speech involves additional challenges compared to wideband data processing, mainly due to channel limitations and to particular characteristics of conversational telephone speech. While in TV speech recognition applications, such as automatic transcription of broadcast news, the presence of telephone data is nearly insignificant (less than 1 %), in most radi...
متن کاملPOLYCOST: A telephone-speech database for speaker recognition
This article presents an overview of the POLYCOST database dedicated to speaker recognition applications over the telephone network. The main characteristics of this database are: large mixed speech corpus size (> 100 speakers), English spoken by foreigners, mainly digits with some free speech, collected through international telephone lines, and more than eight sessions per speaker.
متن کامل